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Abstract 

An instance of the tollbooth problem consists of an undirected network and a collection of single- 
minded customers, each of which is interested in purchasing a fixed path subject to an individual 
budget constraint. The objective is to assign a per-unit price to each edge in a way that maximizes 
the collective revenue obtained from all customers. The revenue generated by any customer is equal 
to the overall price of the edges in her desired path, when this cost falls within her budget; otherwise, 
that customer will not purchase any edge. 

Our main result is a deterministic algorithm for the tollbooth problem on trees whose approxi- 
mation ratio is (9(logm/loglogm), where m denotes the number of edges in the underlying graph. 
This finding improves on the currently best performance guarantees for trees, due to Elbassioni et 
al. (SAGT '09), as well as for paths (commonly known as the highway problem), due to Balcan and 
Blum (EC '06). An additional interesting consequence is a computational separation between toll- 
booth pricing on trees and the original prototype problem of single-minded unlimited supply pricing, 
under a plausible hardness hypothesis due to Demaine et al. (SODA '06). 
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1 Introduction 



An extensively-studied question in economics and operations management is that of pricing an assortment 
of products in a given market, trying to maximize revenue subject to a multitude of constraints. Somewhat 
informally, the inherent difficulty in such settings boils down to the obvious tension between two extremes: 
low prices attract more customers, while high prices generate greater revenues per purchase. Recently, 
the spotlights have been turned on computational challenges in pricing. What seems to be the driving 
force behind this line of research is an immense growth in the range of sources for acquiring costumer 
preferences data, which are now available as a result of the widespread use of the Internet. 

The tollbooth problem. One particular computational task that has received much recent attention is the 
tollbooth problem, which captures optimization-related aspects of pricing connection links in networks, 
e.g., setting prices for road use in a system of toll highways. Formally, an instance of this problem consists 
of an undirected graph G = (V,E) with m edges, which can be broadly interpreted as products with un- 
limited supply. An additional ingredient of the input is a collection C of n single-minded customers, each 
of which is interested in purchasing a fixed path subject to an individual budget constraint. Technically 
speaking, the demand attributes of customer i are represented by a pair (Pj, bj), where P, c G is the path 
she wishes to buy, and bj stands for her budget, namely, the maximum price she is willing to pay for that 
path. Any customer will buy a single unit of each edge in the desired path when its total cost falls within 
her budget; otherwise, she leaves without buying anything. With this setting in mind, the goal is to assign 
a per-unit price to each edge in a way that maximizes the overall revenue. More precisely, the objective is 
to compute a pricing scheme p : E —> R + that maximizes the total revenue over all customers, £" =1 R p (i). 
Here, R p (i) denotes the revenue obtained from customer i, which evaluates to Y>eeP, P( e ) when this cost 
does not exceed bj, or to 0, otherwise. 

Previous work. Guruswami et al. E3l seem to have been the first to study the tollbooth problem. Their 
main results in this context were to show that tollbooth pricing is APX-hard even when the underlying 
graph is a tree, and to devise exact dynamic-programming algorithms for the single-source variant on trees 
and for several other special cases. Additional hardness results were obtained by Briest and Krysta 0, 
who proved that even the seemingly-manageable setting of a simple path, commonly known as the high- 
way problem, is in fact weakly NP-hard. Elbassioni, Raman, Ray, and Sitters iTTBI extended this result by 
establishing strong NP-hardness. On the positive side, however, Balcan and Blum devised an 0(log m) 
approximation for the highway problem; this finding is incomparable with the quasi-PTAS developed by 
Elbassioni, Sitters, and Zhang Ifl4l later on. Finally, and very recently, Elbassioni et al. iTTBl proposed an 
0(log m) approximation for arbitrary trees. To conclude, approximating the tollbooth problem on trees, 
or even on simple paths, beyond the logarithmic threshold has remained an open research question. 

1.1 Our results 

The main result of this paper is a deterministic algorithm for the tollbooth problem on trees whose approx- 
imation ratio is 0(log m/ log log m), improving on the currently best performance guarantees for trees, as 
well as for paths, due to Elbassioni et al. Ifl3l . and to Balcan and Blum 0, respectively. Even though 
the quantitative magnitude of improvement is not that dramatic, our findings have additional interesting 
contributions: 

Conceptual. We identify a computational separation between tollbooth pricing on trees and the original 
prototype problem of single-minded unlimited supply pricing (see Section I1.2I ). The latter cannot be 
approximated within a factor smaller than £2(log n), under a plausible hardness hypothesis regarding the 
balanced bipartite independent set problem lTT2ll . whereas in the former the resulting factor is better by 
Q(loglogn) or more. Regarding the relation between m and n, we remark that an arbitrary instance of 
tollbooth pricing on trees can be reduced to one with m = 0(n). The general idea is that, when the number 
of edges is significantly larger than the number of customers, paths can be iteratively contracted (see, for 
instance, [13, Sec. 2.2]). 
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Technical. We believe that some of the algorithmic tools and analysis methods, illustrated in Sections [2] 
and[3j are of independent interest, and may be applicable in other settings as well. In particular, we have 
already derived new approximability bounds for graph orientation problems |[T8l by synthesizing ideas 
such as balanced decompositions, segment guessing, and randomization. 

1.2 Related work 

We proceed with a brief discussion on the single-minded unlimited supply pricing problem, from which 
the computational setting considered in this paper seems to have emerged. The input to the former prob- 
lem consists of m different products, with unlimited supply, and a collection of n single-minded customers, 
each of which is interested in purchasing a particular subset (or bundle) of products subject to an individ- 
ual budget constraint. The goal is to assign a per-unit price to each product in a way that maximizes the 
overall revenue, where again, a customer will buy a single unit of each product in her bundle only when 
its total cost fits within her budget. This problem was originally introduced by Guruswami et al. E3l . 
who demonstrated that the single-price policy, where all products are given identical prices, guarantees 
an approximation ratio of 0(logn + logra) with respect to the optimal revenue. Later on, Balcan, Blum 
and Mansour H extended this finding to the setting of customers with general valuation functions. From 
a hardness point of view, Demaine, Feige, Hajiaghayi, and Salavatipour |[T2l established several inap- 
proximability results under various complexity assumptions. In particular, they proved a lower bound of 
Q(log n), under a plausible hardness hypothesis regarding the balanced bipartite independent set problem. 

A concurrent line of research focused on computing approximate pricing schemes in terms of other 
problem parameters. For instance, when the number of different products is fixed, Hartline and Koltun Il24l 
showed that an FPTAS can be devised. Briest and Krysta Q suggested an 0(log B + log €) approximation, 
where B is the maximum number of requests per product and £ is the maximum size of any bundle, as well 
as a different algorithm, whose performance guarantee is 0{£ 2 ). Balcan and Blum improved on this 
result, to obtain a ratio of 0{£), and demonstrated that the vertex pricing problem (where £ = 2) can be 
approximated within a factor of 4. Vertex pricing was further studied in E71I261 . Recently, Cheung and 
Swamy ifTTTl design an LP-based algorithm for a more general model of revenue maximization in a limited 
supply scenario that implies an 0(log B) approximation for single-minded unlimited supply pricing. 

We note that revenue maximization, in various colors and flavors, has received a great deal of recent 
attention in the computer science and operations research communities. Therefore, it is beyond the scope 
of this writing to do justice and present an exhaustive survey of previous work. We refer the reader to 
directly related papers |[T9l [161 [H [5j [HI O Ell [TOJ, [22J and to the references therein for a more compre- 
hensive review of the literature. 

2 The Classification Process 

Prior to describing the specifics of our approach in detail, which will inevitably involve delving into tech- 
nicalities, it would be instructive to concentrate on the bigger picture. For this purpose, we begin by 
pointing out that the performance guarantee of 0(log m/ log log m) is obtained by employing the classify- 
and-select paradigm. More specifically, we exploit various structural properties to partition the collection 
of customers into 0(logra/ log log ra) pairwise-disjoint classes. For each such class, given the additional 
structure imposed, we separately compute a pricing scheme whose overall revenue comes within a con- 
stant factor of the optimal revenue attainable from this class. In particular, each class is treated in a 
completely independent fashion, as if there are no other classes under consideration. Consequently, since 
the objective function is subadditive, the above-mentioned approximation ratio follows by picking, out of 
the set of all pricing schemes computed, the one that collects maximal revenue. We proceed by describing 
the customer classification process; this exposition will allow us to focus attention on the more involved 
single-class problem later on. 
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2.1 Classifying customers via balanced decompositions 

In the following, we give a formal account of the process by which customers are partitioned into classes. 
To this end, we begin by introducing the notion of an almost-balanced decomposition, which can be 
viewed as a generalization of the well-known centroid decomposition ITT71 . It is worth noting that struc- 
tural properties in this spirit have been explored and exploited in various settings (see, e.g., |[T5ll20ll25l ). 

Definition 2.1. Let T - (V, E) be a tree. An almost balanced k-decomposition of T is a partition of T into 
k edge-disjoint subtrees T\,...,Tk such that each subtree contains between \E\/(3k) and 3\E\/k edges. 

For ease of presentation, we defer the proof of the following lemma to Appendix lA.il 

Lemma 2.2. Let T — (V, E) be a tree with \E\ > k. An almost balanced k-decomposition of T exists and 
can be found in polynomial time. Moreover, the number of vertices that are shared by at least two subtrees 
is less than k. 

The classification process corresponds to a recursive decomposition of the input tree T; to better 
understand the upcoming discussion, we advise the reader to consult Figure [T] Let T\ = {T\, . . . , 7\} 
be an almost balanced ^-decomposition of T into k edge-disjoint subtrees. The first class of customers, 
C\, consists of all customers separated by Ti, that is, customers / for which the endpoints of the desired 
path Pi (henceforth, s^ and ?;) reside in different subtrees of the decomposition T\ . Now, to classify the 
remaining set of customers, C \ C\, we recursively apply the previously-described procedure with respect 
to the collection of subtrees in T\ . Specifically, in the second level of the recursion, an almost balanced 
^-decomposition is computed in each of the subtrees T\,.,.,Tk, to obtain a set T%, comprising of k 2 
subtrees. The second class of customers, C%, consists of all yet-unclassified customers separated by 7*2- 
In other words, the endpoints of each path P,, for which i e C2, reside in different subtrees of T%, but in 
the same subtree of T\. The remaining classes C3, C4, ... are defined similarly. It is important to note that 
the recursive process ends as soon as we arrive at a subtree with strictly less than k edges. In this case, we 
make use of the trivial decomposition, where the given subtree is broken into individual edges. 



/ti\ 




Figure 1 : A schematic example for the collection of customers separated by an almost balanced decom- 
position, where each triangle marks a single subtree. Here, customers 1, 2, and 3 are separated, whereas 4 
and 5 are not. 

It is quite obvious that, up until this point in time, k was treated as a parameter whose value has 
not been determined yet. For our purposes, we employ the above-mentioned classification process with 
k - \\og l l 2 m\. With this value of k at hand3, one can easily verify that the overall number of levels in 

'A careful inspection later on shows that this particular selection is rather arbitrary, and any fixed exponent smaller than 1 
will do the trick. 
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the recursion, or equivalently, the number of customer classes is 0(log k m) = 0(logm/loglogm). This 
claim is immediately implied by observing that the maximum size of a subtree in level £ of the recursion 
is at most (3/k) c ■ \E\. As a side note, we remark that the balanced decomposition property, ensuring that 
all subtrees are of size roughly \E\/k, does not play any role from this point on, as its sole purpose was to 
restrict the number of classes to 0(log m/ log log m). 

2.2 Why handling a single decomposition is sufficient 

We remind the reader that a class of customers, say Q, generally consists of several subsets of customers, 
each created when different subtrees in 77- 1 are partitioned by the decomposition 77- More specifically, 
assuming that the subtrees in 77-i are T\, Ti, . . ., the class Q can be written as the disjoint union of 
C\,C 2 r ..., where C J e is the set of customers that are first separated when Tj is partitioned. Notice that the 
desired path of any customer separated by some subtree decomposition must be contained in that subtree, 
since otherwise, this customer would have been separated in previous recursion steps. This observation 
implies that it is sufficient to compute an approximate pricing scheme for a single subtree decomposition 
and its induced set of separated customers. Given a polynomial-time algorithm that computes such pricing 
schemes, one can sequentially apply it to each of the subtree decompositions in the same recursion level. 
The resulting pricing schemes (in edge-disjoint subtrees) can then be "glued" to form a single scheme, 
defined for the entire edge set, collecting at least as much revenue as the sum of all individual subtree 
revenues. 

3 The Single Decomposition Algorithm 

In what follows, we focus our attention on a single decomposition, and devise a randomized algorithm 
that computes a pricing scheme whose expected revenue is within a constant factor of the optimal revenue 
attainable for this decomposition. Later on, we will argue that this algorithm can be easily derandomized. 
Formally, an instance of the problem in question consists of a tree T = (V, E), and a partition 7" = 
{T\, . , . ,Tk} of this tree into k edge-disjoint subtrees, where k < flog ' 2 ml, such that the number of 
vertices shared by at least two subtrees is less than k. In addition, we are given a collection C of n 
customers, satisfying the following properties: 

1. Each customer i wishes to purchase a path P, so long as the overall price of this path does not exceed 
the budget b{. 

2. Each customer path P, is separated by 7", meaning that the endpoints of P, reside in different 
subtrees of the decomposition 7". 

3.1 Notation and Terminology 

For ease of presentation, it would be convenient to introduce some notation and terminology before laying 
down the nuts and bolts of our algorithm. To better understand the suggested notation, we refer the reader 
to a concrete example in Figure |2 

• Let Vb £ V be the set of border vertices of 7", that is, the set of vertices that are shared by at 
least two subtrees in 7". In addition, let S c T be the skeleton of 7", namely, the minimal subtree 
spanned by all border vertices. Note that this subtree consists of the union of paths connecting any 
two vertices in Vb- 

• Let p* : E — > R.+ be an optimal pricing scheme, with an overall revenue of OPT. 

• Now, recall that the endpoints of each customer path P, reside in different subtrees of the decom- 
position T, meaning that P, must traverse at least one border vertex. Therefore, we can divide each 
customer path, with endpoints Sj and tj, into three (possibly empty) parts: 

1. A subpath between s; and its closest skeleton vertex v,,.. 
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2. A subpath between and its closest skeleton vertex v t . . 

3. A subpath between v Sj and v tj , along the skeleton. 

Based on this definition, let R s p ,(i), R T p ,{i), and R^,(i) denote the revenues obtained in the pricing 
scheme p* from the subpaths of customer i, respective!}!!. Clearly, R p *(i) = R p ,(i) + Rp*(i) + Rpi(i)- 




Figure 2: (a) An almost balanced 6-decomposition of a tree with 18 edges. Note that the black vertices 
are border vertices, and the heavy edges make up the skeleton S. (b) Dividing the customer path P; into 
three parts. 

3.2 The algorithm 

Having all necessary definitions in place, we are now ready to present the specifics of our pricing algo- 
rithm, and to analyze its performance. In the remainder of this section, we consider two complementing 
scenarios, depending on the contribution of different path parts to the revenue generated by the optimal 
pricing scheme p* . Somewhat informally, the first scenario captures a situation where a significant por- 
tion of OPT is gained from customer subpaths traversing non-skeleton edges, or in other words, when 
Z"=i(^p*(0 + Rp*(ff) = £2(1) • OPT. The second scenario corresponds to a situation where a large portion 
is delivered by subpaths along the skeleton, that is, when £" =1 Rti(i) = £2(1) ■ OPT. For each of these sce- 
narios, we compute a pricing scheme whose expected revenue is within a constant factor of optimal. As a 
result, we obtain a constant approximation ratio by computing both pricing schemes and picking the one 
that achieves the maximum revenue. For sake of simplicity, we begin by considering the easy-to-handle 
first scenario, noting that the second, more challenging scenario, will be discussed in the sequel. 

Scenario I: E" =1 (/^,(0 + R T ,(i)) > OPT/2 

In the present setting, at least half of the optimal revenue is collected from customer subpaths consisting of 
non-skeleton edges, meaning that the collective contribution of the subpaths Pj\S, over all customers i, is 
at least OPT/2. The algorithmic tool that allows us to handle this scenario is a polynomial-time procedure, 
due to Gurus wami et al. E3l . for solving the single-source tollbooth problem on trees to optimality. Here, 
the underlying assumption is that all customer paths share a common endpoint, that is, si = • • • = s n . With 
this tool at hand, the algorithm proceeds as follows: 

1. Each of the decomposition subtrees, T\, . . . , 7\, is randomly and independently marked as being 
active, with probability 1 /2; otherwise, that tree is inactive. 

2. We decide in advance that all skeleton edges, as well as all edges within inactive trees, could be 
purchased free of charge, i.e., their price is set to zero. 

2 These superscripts stand for: 5 - subpath adjacent to j,-; T - subpath adjacent to t,\ and M - middle subpath. 
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3. Let C{Tj) denote the subset of customers with an endpoint in Tj. For each active subtree Tj, we 
contract all skeleton edges in that subtree into a single roojl, designated by r, and apply the single- 
source algorithm, when the underlying set of customers are those in C(Tj). Each such customer / is 
now interested in purchasing the path connecting its endpoint in Tj to the root r, still with a budget 
of bi. Note that each customer has at most one endpoint in Tj, since otherwise, Tj contains her 
desired path, in contradiction to the underlying setting. 

Lemma 3.1. The computed pricing scheme guarantees an expected revenue of at least OPT/8. 

Proof. We begin by observing that the expression E?-i(P„»(0 + ^I«(0)> which stands for the overall 

lip p 

revenue in the optimal pricing scheme obtained from non-skeleton edges (over all customers), can be 
rewritten as the sum of revenues obtained from the subtrees Pi, . . . ,7\. That is, recollecting that C(Tj) 
denotes the subset of customers with an endpoint in Tj, we have 

n k 

£(j$(0+*J.(0) = 2 £ R,A(P<\S)nTj), 

i=l y=l ieC(Tj) 

where \ S) n Tj) denotes the revenue obtained from the subpath of P, within the subtree Tj, 

excluding skeleton edges. 

Now, for each active subtree Tj, one can easily verify that the pricing scheme p* , restricted to the 
edges of Tj (or, more precisely, to its skeleton-contracted version), constitutes a feasible solution to the 
corresponding single-source problem, with an objective value of at least Z; e c(r ; )^p*((P' \ n Tj). Since 
for this particular problem we compute an optimal pricing scheme, the revenue collected from customers 
in C(Tj), restricted to the subtree Tj, is at least that value. At this point in time, we observe that, for each 
such customer, with probability 1/2 we will be able to stay within the budget h\, and collect exactly the 
same profit for the entire path Pi, with additional edges outside of Tj. To establish this claim, since the 
price of all skeleton edges has been set to zero, it is sufficient to note that the subpath of P; that does not 
traverse edges either in the subtree Tj or the skeleton S, i.e., Pj \ (Tj U S), is entirely contained within a 
subtree different from Tj; that subtree will be marked as inactive with probability 1/2, and its edge prices 
will be set to zero. It follows that the expected revenue of the pricing scheme computed is at least 

}Z Z V((^A5)nr,) = 1^(^,(0 + ^,(/))>2p:. 

7=1 ieC(Tj) i=l 



Scenario II: £" , R M (i) > OPT/2 
i=i p* 

In this setting, at least half of the optimal revenue is collected from customer subpaths consisting of 
skeleton edges, meaning that the collective contribution of the subpaths P; n <S, over all customers i, is at 
least OPT/2. Our algorithm begins by first deciding that all non-skeleton edges could be purchased for 
free, and sets their price to zero. As a result, we may assume that the endpoints of each customer path are 
located on the skeleton, since otherwise, we can relocate them to their closest skeleton vertices, without 
any consequences whatsoever. With this structural alteration in mind, the skeleton pricing is carried out in 
two phases: segment guessing, where close estimates of the optimal prices along disjoint subpaths of the 
skeleton are obtained, followed by randomized assignment, where prices are associated with individual 
skeleton edges. 

Phase I: segment guessing. We remind the reader that the skeleton S is the minimal subtree of T 
spanned by all border vertices Vg. We denote by Vj the set of junction vertices, defined as non-border 
skeleton vertices with degree at least 3 (counting only skeleton edges). One can easily verify that \Vj\ < 

3 The contraction is achieved by removing all skeleton edges, and unifying their endpoints into a representative vertex. 
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|Vg| < k. In addition, we call the vertex set Vb U Vj the core of the skeleton S. Based on these definitions, 
we can partition the skeleton into a collection 2,(S) of edge-disjoint paths, which are referred to as seg- 
ments. Each such segment is a subpath of S whose endpoints are core vertices, but its interior traverses 
only non-core vertices. Obviously, |X(«S)| = \Vb\ + \Vj\ - 1 < 2k. 

In what follows, we argue that one could obtain in polynomial time a close estimate for the total price 
p*(cr) = Yieea- P*( e ) °f eacn segment cr e X(«S), simultaneously for all segments. To this end, it is sufficient 
to prove that there exists a very small set of prices T, and a corresponding pricing scheme pr : E — > R+, 
such that the price given by pr to every segment falls within T, namely, pr(o~) € T for each cr e £(*S), and 
such that pr(o~) approximates p*(cr) well. At the same time, we would like to make sure that the overall 
revenue that the supporting pricing scheme pr collects from customer subpaths consisting of skeleton 
edges still forms a fixed portion of YJl=\ ^(0- A construction of this nature is given in the next lemma, 
whose proof appears in Appendix IA.2I 

Lemma 3.2. Let Y = {2 e ■ b max /(4nm) : < £ < Llog(4«m 2 )J( U {0}, where b max = max,- bu Then, there is 
a pricing scheme pr : E — > R + that satisfies the following properties: 

1. pr(o~) e r for each cr e £(<S). 

By Lemma [3T2l we conclude that to obtain a close estimate for the total price p*(cr) of each segment 
cr e E(«S), simultaneously for all segments, the total number of price assignments to be examined is of 
polynomial size, since 

prf(S)l = (0(i g(nm))) OW = (0(log(nm))) 0(logW ' m) = o(nm) . 

Consequently, we may assume without loss of generality that the set of segment prices {pr(o~) : cr e X(«S)}, 
given in Lemma l3~l2l is known in advance. This assumption can be easily enforced by enumerating over all 
oinm) possible assignments. On the other hand, it is worth noting that we do not assume any knowledge 
of the edge-specific pricing pr : E — > R.+. 

Phase II: randomized assignment. The goal of this phase is to complete the pricing scheme by assign- 
ing carefully-picked random prices to individual skeleton edges, based on the estimated segment prices 
{pr(o") : or € S(«S)}. The crux lies in making sure that the edge-specific prices always respect the outcome 
of the segment guessing phase, as stated in the next invariant. 

Invariant 3.3. With probability 1, the total price of each segment cr e X(«S) is exactly pr(o~). 

Our assignment procedure consists of independent steps, where segments are processed one after the 
other. In each step, we consider a skeleton segment cr = (v\, . . . ,V[), and assign prices to its edges 
(vi , V2), - - - , (vf_i , V{ ) in a way that satisfies Invariant 13.31 Specifically, as illustrated in Figure Ha), we 
pick one of the following four price assignments uniformly at random: 

1. Assign a price of pr(o~) to the edge (vi, V2), and zero prices to the remaining edges. 

2. Assign a price of pr(cr) to the edge (yt-\, vi), and zero prices to the remaining edges. 

3. Assign prices based on a vi -rooted single-source problem (see description below). 

4. Assign prices based on a v^-rooted single-source problem, analogous to item 3. 

To complete the description of our algorithm, it remains to explain how the single-source instances in 
items 3 and 4 are created and solved; for brevity of presentation, we focus on the vi -rooted case, noting 
that the opposite case is identical, up to changing the roles of vi and Vf. Once again, we will employ 
the dynamic-programming algorithm of Guruswami et al. E3l for solving the single-source tollbooth 
problem on trees, a valuable tool that was introduced in Scenario I. In particular, the vi-rooted instance is 
comprised of the following components: 
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• The underlying graph is the segment cr. 

• The set of customers are those with an endpoint in [v%, , . . , V(-\}, whose desired path exits the 
segment cr through vi . 

• For each customer i under consideration, we set up a new endpoint at vi instead of the one outside 
cr, and change her budget to mm{pY{cr), bj - Hs-io-cp, Pr(cr)}- Note that the latter term is exactly the 
budget remaining to purchase P,, assuming that a total price of pr(cr) has already been paid for each 
segment <x that is fully-contained in P;. 

Needless to say, the single-source algorithm does not set a price for the edge (yt-\, V{), as it is not contained 
in any desired path of the v\ -rooted instance constructed. Therefore, to ensure that Invariant 13.31 holds, 
we set the price of (v(-i,V[) to be the difference between the total segment price pr(cr) and the total 
newly-computed price of (vi, V2), . . . , (v^_2,v^_i). Here, it is important to point out that this difference is 
indeed non-negative, since the dynamic -programming algorithm guarantees that the price of each subpath 
(vi, . . . ,Vj) is equal to the budget of some customer whose desired path is contained in (vi, . . . ,vj) (see 
E3l Thm. 5.3]); on the other hand, the budget of every customer cannot exceed pr(o~), by definition. 

Vi V 2 V 3 Vt-2 Vl-i v e 



assignment 1: 



assignment 2: 



assignment 3: 



assignment 4: 




customers leaving through v t 



(a) (b) 

Figure 3: (a) A schematic description of the four random assignments for the segment cr. Here, core 
vertices are marked in black, (b) The new customer path partition. 



Analysis. The remainder of this section is devoted to proving that the expected revenue of the pricing 
scheme computed in the randomized assignment phase is within a constant factor of optimal, as formally 
stated in the following lemma. 

Lemma 3.4. The pricing scheme constructed in the randomized assignment phase guarantees an expected 
revenue of at least OPT/ 25 6. 

Recall that we have previously assumed the endpoints of each customer path P, to be located on the 
skeleton. Moreover, since these endpoints reside in different subtrees of the decomposition 7~, the path 
Pi must traverse at least one border vertex. For this reason, as shown in Figure EJb), we can divide each 
customer path, with endpoints Sj and into three (possibly empty) parts: 

1 . A subpath, along a partial segment, between Sj and its closest core vertex u Si . 

2. A subpath, along a partial segment, between t\ and its closest core vertex u tj . 

3. A subpath between u Sj and u tj , along a sequence of complete segments. 
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With these definitions in mind, let R s pr (i), Rj, r (i), and R^ r (i) denote the revenues obtained in the esti- 
mated pricing scheme pr from the subpaths of customer i, respectively. We remark that pj- is the pricing 
scheme whose existence has been established in Lemma I3T21 Note that, by item 2 of this lemma, we can 
bound the sum of the above-mentioned revenues, over all customers, in terms of OPT, since 

£ «(o + j£® + <(o) = z > \ £ <(o > ^ . 

!=1 1=1 1=1 

In what follows, we consider two cases, depending on the contribution of different path parts to the revenue 
generated by the pricing scheme pr- For each of these cases, we show that the randomized assignment 
phase computes a pricing scheme whose expected revenue is within a constant factor of optimal. 

Case I: ^ OPT/16. In this setting, a significant fraction of the optimal revenue is collected 

from customer subpaths consisting of complete segments, meaning that the collective contribution of the 
paths u Sj w u t ., over all customers i, is at least OPT/ 16. Notice that, by Invariant l3.3l our pricing scheme 
always assigns a total price of pr(o~) to each segment cr e S(»S), implying that the overall price of u Sj w u t . 
is exactly R^ r (i). Therefore, the revenue from customer i is at least R^ r {i), unless the total price of the 
remaining partial segments w u Si and t\ w u tj exceeds the residual budget, bj - R^.(i). We next 
argue that, with probability at least 1/16, all edges on these partial segments will be given zero prices. 
Consequently, the expected revenue from customer i is at least R^ r (i)/ 16, and by linearity of expectation, 

the overall expected revenue is no less than £" =1 7?^(/)/16 > OPT/256. 

For the purpose of establishing the previously mentioned claim, we will show that, with probability at 
least 1 /A, each and every edge on si w u s . is assigned a zero price. The claim then follows by observing 
that an identical property with respect to t\ w u t . can be proven along the same lines, and also, that the 
two events are independent. Now, notice that if Sj is a core vertex then the claim trivially holds, since 
s; w u Si is empty. Hence, let us consider the case where Sj is an internal vertex on the segment between 
the core vertices u s . and u Sj . In this case, with probability 1/4, our algorithm assigns a price of zero to all 
edges in this segment, except for the edge adjacent to u s .; in particular, all edges on Sj w u Sj are assigned 
zero prices. 

Case II: E" =1 (*1,(0 + R l r (W> ^ OPT/16. Here, a constant fraction of the optimal revenue is collected 
from customer subpaths consisting of partial segments, meaning that the collective contribution of the 
paths si w u Sj and w u tj , over all customers i, is at least OPT/ 16. Consider some segment cr = 
(vi, . . . ,ve), and let C(cr) denote the collection of customers that have one of their path endpoints in 
{v2, . . . ,V(-\}. Clearly, if we use Rp T (i) to denote the revenue obtained in the pricing scheme pr from 
customer i along the segment cr, then 

n 

<reS(5) ieC(cr) i=l 

Furthermore, let C L (cr) and C R (cr) be the sets of customers in C(o~) whose desired path exits the segment 
cr through v\ and vt, respectively. Obviously, C(cr) = C L (cr) U C R (cr). 

Recall that, with probability 1 /A, our algorithm computes an optimal pricing scheme pi for a vi -rooted 
single-source problem on the segment cr. In particular, it is not difficult to verify that the underlying set 
of customers would be C L {cr), with paths restricted to the segment cr, and moreover, that each customer i 
will be associated with a newly-defined budget of min{pr(o")> bj - Yjo-.o-qp, Pr(<5")l- Notice that the pricing 
scheme py forms a feasible solution to this instance, implying that if Rp L (j) denotes the overall revenue 
from customer i along the segment cr with respect to pi, we have 

Z *£® * Z + Z ^r© • 

ieC L {cr) ieC L (o-y. ieC L (jr): 

Sj€cr tj€cr 
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Similarly, with probability 1 /4, we compute an optimal pricing scheme pn for a v^-rooted single-source 
problem, and using almost identical arguments, obtain the inequality 



ieCV) 



E 

eC^to-): 



z 

eC R (cr): 
t;£(r 



We proceed by showing that for every productive customer i e C(cr), with probability at least 1 /4, the 
total price that will be assigned to the path Pj, excluding edges residing within the segment cr, is exactly 
Yjo-.a-QPi Pr(<7")- In this case, the (global) budget bj is not exceeded, since customer i was given a budget 
of no more than h, - Z]<x:a-cp, Pv(d") in the single-source problem on cr. Prior to proving this claim, we 
remark that it is indeed sufficient to conclude Case II, since the overall expected revenue would be at least 



r 6 Z Z R p^ + Z w 

o-e£(5) Vi'eC L (o-) ieCV) 



> 



- y 

16 A, 



creS(S) 



Z Z <® + Z Z 4© 



eC L (o-): 



/eC L (o-): 



teC fi (<r): 



= ^ Z Z 



o-eS(5) ;eC(o-) 



= 17Z(^ + <r(0) 



> 



16 * 
OPT 



256 



To prove the last claim, consider some particular customer i e C(cr), and without loss of generality, 
suppose that the endpoint of Pi that resides within the segment cr is Sj. We therefore focus on bounding the 
total price of the subpath connecting u Sj and tj, which can be broken into two probabilistically-independent 
parts: 

• A subpath between u Si and u ti , along a sequence of complete segments. By Invariant 13.31 with 
probability 1, the total price of each complete segment cr is pr(<5")> immediately implying that the 
total price of u Sj u tj is Z<f :<tcp, Pr(<5")- 

• A subpath, along a partial segment (different from cr), between t[ and u tj . Here, with probability at 
least 1 1 A, each and every edge on tj u tj is assigned a zero price. The arguments for proving this 
case are identical to those in Case I, and we do not repeat them to avoid duplicity. 



3.3 Derandomization 

The avid reader may already have noticed that the extent to which we utilize randomization is rather 
limited, and that its foremost purpose is to make the presentation of our algorithm significantly simpler. 
More specifically, in Scenario I each subtree in the decomposition is randomly marked as being active 
or inactive, whereas in Scenario II one of four possible price assignments is picked at random for each 
segment. In other words, all we need to obtain a deterministic algorithm are two uniform sample spaces, 
with 0(1) possible values for 0(log^ 2 m) random variables. These can be constructed in polynomial 
time either explicitly, as there are only 0(m°^') outcomes to examine, or in a more compact way, by 
observing that nothing more than pairwise-independence (see, for instance, El Chap. 15]) is required for 
the preceding analysis. 
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A Additional Proofs 
A.l Proof of Lemma IXH 

We begin by presenting a well-known result regarding centroid decompositions in trees. 

Definition A.l. Let T = (V, E) be a tree. A centroid decomposition of T is a partition of T into two edge- 
disjoint subtrees (sharing a common vertex) such that each subtree contains between |2s|/3 and 2|£"|/3 
edges. 

Theorem A.2. ( II17II ) Let T - (V, E) be a tree with \E\ > 2. A centroid decomposition ofT exists and can 
be found in linear time. 

We now propose an iterative process for generating an almost balanced ^-decomposition of T in 
polynomial time. This process consists of k - 1 steps, where in each step a centroid decomposition is 
applied to a subtree having maximal number of edges. That is, in the first step, a centroid decomposition 
is applied to T and results in subtrees T\ and T2; in the second step, a centroid decomposition is applied 
to the subtree with maximal number of edges (out of T\ and T2); so forth and so on. Clearly, this process 
can be implemented to run in polynomial time, as implied by Theorem lA.2l 

We argue that the collection of k subtrees resulting from this process is indeed an almost balanced 
^-decomposition, noting in advance that the number of vertices shared by at least two subtrees is less than 
k. For this purpose, we are left to prove that each subtree contains between \E\/(3k) and 3\E\/k edges. We 
next claim that in each step, the number of edges in the maximal subtree, i.e., the subtree that has maximal 
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number of edges, is no more than 3 times greater than the number of edges in the minimal subtree, i.e., 
the subtree with minimal number of edges. This claim implies that, at the end of the process, each subtree 
must have at least \E\/(3k) edges. Otherwise, it follows that all k subtrees must have less than \E\/k edges, 
and thus, the decomposition consists of less than \E\ edges. By employing similar arguments, one can 
show that each subtree has no more than 3\E\/k edges. 

We turn to prove the above-mentioned claim by induction on the number of steps performed. At 
the beginning of the process, the claim trivially holds. Now, suppose the claim holds at the beginning 
of some step. Namely, the maximal subtree has r max edges, the minimal subtree has r m j n edges, and 
'max/'min ^ 3. Recall that the centroid decomposition partitions a subtree with maximal number of edges 
into two subtrees that contain between r max /3 and 2r max /3 edges. This implies that at the beginning of 
the following step, the maximal subtree has no more than r max edges, while the minimal subtree has at 
least min{r m i n , r max /3} edges. Notice that in either case, the size ratio between the maximal and minimal 
subtrees is at most 3. 

A.2 Proof of Lemma S3] 

The general idea behind our proof is to perform a sequence of modifications to the optimal pricing scheme 
p*, trying to arrive at a new scheme pr that satisfies the required properties. For this purpose, we begin 
by observing that, without loss of generality, p*(e) < b max for every edge e e E; otherwise, by setting 
p*(e) = b max , the overall revenue may only improve. We now separately consider each segment cr e X(<S), 
and proceed as follows: 

1. If p*(<r) < b max /(4nm), we set the price of every edge in cr to zero. 

2. If bmax/(4nm) < p*(o~) < mb max , we uniformly scale down the prices of all edges in cr such that the 
newly-defined price of cr will be equal to the maximal value y e T such that y < p*(cr). For this 
purpose, the scaling factor is simply yj p*(cr) > 1 /2, where the last inequality holds since T consists 
of a geometric sequence between b max /(4nm) and mb max , with a multiplier of 2. 

This construction clearly satisfies the first property, and it remains to prove £" =1 ^£(0 ^ S"=i ^£(0/4. 
To validate the last inequality, note that our modifications may lead to two types of revenue losses: additive 
(due to item 1) and multiplicative (due to item 2). The additive loss can be easily bounded by observing 
that each edge appears at most once on any customer path and that the skeleton consists of at most m 
edges. Therefore, the entire contribution of edges that were modified in the first item can be bounded by 

bmax b max OPT 1 V dM,-n 

Anm 4 4 2 ^ p 

where the first inequality holds since OPT > b max , as an overall revenue of b max can obviously be attained. 
In addition, the multiplicative loss can be bounded by noting that the scaling factor in the second item is 
at least 1/2. It follows that £" =1 R*(f) > £- = i R%(i)/4. 
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